monster ai model
In AI, You Want to Be a Jazz Band
As I continue working on exciting research in Artificial Intelligence, I like making parallels with other areas of science and life in general. I think there is a perfect metaphor from music that explains the AI market. Since helping people fight their health problems is my passion, I'll focus on AI in healthcare. The AI market is currently overwhelmingly at the two opposite extremes -- a high school band and a 7,500-person orchestra. Both extremes are perfectly acceptable and have their audiences. However, there is not much in the middle.
GPT 3 and Monster AI Models: What is in Store for the Future?
GPT-3 or Generative Pre-trained Transformer 3 is a language model that was created by OpenAI, an artificial intelligence research laboratory in San Francisco. The 175-billion parameter deep learning model is capable of producing human-like text and was trained on large text datasets with hundreds of billions of words. When OpenAI released GPT-3, in June 2020, the neural network's apparent grasp of the language was uncanny. It could generate convincing sentences, converse with humans, and even autocomplete code. GPT-3 was also monstrous in scale--larger than any other neural network ever built.
2021 was the year of monster AI models
What does it mean for a model to be large? The size of a model--a trained neural network--is measured by the number of parameters it has. These are the values in the network that get tweaked over and over again during training and are then used to make the model's predictions. Roughly speaking, the more parameters a model has, the more information it can soak up from its training data, and the more accurate its predictions about fresh data will be. GPT-3 has 175 billion parameters--10 times more than its predecessor, GPT-2.